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Abstract 

Suppose that a real valued process X is given as a solution to 
a stochastic differential equation. Then, for any twice continuously 
differentiable function /, the backward Kolmogorov equation gives a 
condition for f(t,X) to be a local martingale. 

We generalize the backward equation in two main ways. First, it 
is extended to non-diffcrcntiable functions. Second, the process X is 
not required to satisfy an SDE. Instead, it is only required to be a 
quasimartingalc satisfying an integrability condition, and the martin- 
gale condition for f(t,X) is then expressed in terms of the marginal 
distributions, drift measure and jumps of X. The proof involves the 
stochastic calculus of Dirichlet processes and a time- reversal argument. 

These results are then applied to show that a continuous and strong 
Markov martingale is uniquely determined by its marginal distribu- 
tions. 



1 Introduction 

Suppose that we have a real valued process X satisfying a stochastic differ- 
ential equation of the form 

(1.1) dX t = a(t,X t )dWt + b(t,X t )dt 

where W is a Brownian motion. The uniqueness of solutions to this stochas- 
tic differential equation in the case where a and b are Lipschitz continuous 
is well known ([9] Theorem 7.2). In fact, as we are just considering the case 
of one dimensional diffusions, the Yamada-Watanabe theorem ([9] Theorem 
40.1) says that the Lipschitz condition for a can be relaxed to a Holder 
condition of order 1/2. When a > then a result of Stroock and Varadhan 
in [11] shows that weak uniqueness holds as long as a is continuous and b is 
locally bounded. 



Key Words: diffusion, strong Markov, martingale, quasimartingale, Dirichlet process. 
AMS 2000 Classification: 60J60, 60J25, 60G44, 60H10. 



2 



GEORGE LOWTHER 



An alternative approach is to consider the backward Kolmogorov equa- 
tion (or Feynman-Kac formula). Given a twice continuously differentiable 
function f(t,x), this says that f(t,Xf) is a local martingale if 

Q 1 Q 

(1.2) -f^ x) + -a\t, x)-g^f(t, x) + b(t, x)fof(t, x) = 0, 

which is a simple consequence of Ito's formula. Suppose that a and b are 
sufficiently well behaved so that for any T > and smooth bounded function 
g(x), there exists a bounded solution to (1.2) for t < T and satisfying the 
boundary condition f(T,x) = g{x). Then, f(t,X t ) would be a martingale 
and consequently, 

(1-3) f(t,X t )=E[g(X T )\T t }. 

This shows that X is Markov and determines its transition probabilities. So, 
the joint distribution of X is uniquely defined once Xq is known. Conversely, 
given a solution X to (1.1) we could define / by equation (1.3) and attempt 
to show that it is twice continuously differentiable, so it would satisfy the 
backward equation. Unfortunately this is not possible in general, in which 
case uniqueness of SDE (1.1) can fail. 

The aim of this paper is prove a generalization of the backward equation. 
Firstly, we generalize to include non-differentiable functions /. Secondly, we 
shall extend it to a much more general class of processes, which do not have 
to satisfy any stochastic differential equation such as (1.1). In fact, we shall 
extend it to all quasimartingales satisfying an integrability condition. In 
that case the diffusion coefficients a and b might not exist, and equation 
(1.2) is replaced by one involving the marginal distributions, drift measure 
and jump components of X. 

This result is applied to the example of continuous and strong Markov 
martingales, showing that they are uniquely determined by their marginal 
distributions. Throughout this paper, we work with a filtered probability 
space (O, J 7 , (J r t ) t6 K + , P), and the strong Markov property is defined as fol- 
lows. 

Definition 1.1. A real valued process X is strong Markov if for every 
bounded, measurable g : R — > R and every t > there exists a measurable 
f : R + x R -> R such that 

f(T,X T )=E[g(X T+t )\F T ] 

for every finite stopping time t. 

We now state the uniqueness result for continuous martingales. 

Theorem 1.2. Let X and Y be continuous and strong Markov martingales 
such that, for every t E R + , Xt and Y± have the same distribution. Then, 
X and Y have the same joint distribution. 
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The proof that this follows from our generalized backward equation is 
given in Section 2. The question of existence of continuous martingales with 
given marginals will be studied in a future paper. 

Let us briefly discuss the idea here in the case where X is a martingale, 
so the coefficient b in SDE (1.1) is zero. Then, let C : R+ x R -> R be the 
function 

(1.4) C(t,x)=E[(X t -x) + }. 

Clearly, C(t,x) depends only on the distribution of Xf. Furthermore, it 
is easy to see that knowing C is equivalent to knowing all the marginal 
distributions of X. If A" has smooth probability densities, the following 
follows from the Fokker-Planck (or forward Kolmogorov) equation 

(1.5) ^c(t,x) = ^a 2 (t,x)^C(t,x). 

This equation is well known in finance, where it is used to calibrate the 
local volatility model to observed option prices (see [2]). We note that it 
is possible to combine this with (1.2) in order to eliminate a, giving the 
following martingale condition for f(t,Xf), 

dj_&C_ dCd 2 f _ 
dt dx 2 dt dx 2 

Note that, although we have removed any reference to SDE (1.1), the equa- 
tion is still only defined when / and C are twice continuously differentiable. 
The idea now is to smooth this expression by multiplying by a twice differ- 
entiable function with compact support in (0, oo) x R and integrating. To 
this end, make the following definition, 

So /i[/,c](^) is a linear function of 6, and is symmetric in exchanging / and 
C. Integration by parts gives 

fffdfdcde dedfdc dcd6df\ 
(i.8) n [f ,cm = JJ {^^m~dx'di~^~'^dx'm) dtdx 

and the martingale condition for f(t,Xt) can be written as H[f,c] = 0- I n 
order for the right hand side of (1.8) to be well defined, / and C only have 
to be once differentiable. In fact, we can go further and remove the first 
order derivatives with respect to t, replacing them with Lebesgue-Stieltjes 
integrals. For every i£l such that f(t,x) is right-continuous with locally 
finite variation in t, the Lebesgue-Stieltjes integral f ■ ckf(t, x) is (locally) a 
finite signed measure satisfying 

h 

dtf(t, x) = f(ti,x) - /(t , x). 

to 
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In particular, if / is continuously differentiable in t then dtf = (df /dt) dt. 
Furthermore, for the right hand side of (1.8) to be well defined, it is only 
necessary for the partial derivatives with respect to x to exist in an almost- 
everywhere sense. In particular, equation (1.4) implies that C(t,x) will be 
convex in x, so its partial derivatives exist almost everywhere, and Jensen's 
inequality implies that it will be increasing in t. So, expression (1.8) is well 
defined even if C is not smooth. Similarly if / is defined by (1.3), g is convex, 
and X is a continuous and strong Markov martingale then f(t,x) will be 
convex in x and decreasing in t (see [7]). Then, as we shall see, this leads to 
a well-defined expression for fi^ C j(9). 

Although the above argument gives us reason to believe that fJ>[f t c] = is 
a valid martingale condition for f(t,Xt), it only proves it in the case where 
/ and C are smooth, and X satisfies the SDE (1.1). The general case is 
much more difficult, because results such as Ito's formula can't be applied 
directly to non-differentiable functions. Proving that //r^cn = is a sufficient 
condition for f(t,Xt) to be a martingale is the hardest direction, because 
we do not even know a priori that it is a semimartingale. This will require 
the results of [6] which show that f(t,Xt) is a Dirichlet process, so can be 
decomposed into local martingale and zero continuous quadratic variation 
terms. A time reversal argument is applied in Sections 7 and 8 to show that 
the zero continuous quadratic variation term has finite variation. 

The analysis above can be extended to more general jump diffusions 
given by a time dependent generator 



where b, a, j are sufficiently well-behaved functions. Again, it is possible to 
eliminate a from the forward and backward equations. We do not go through 
the details here as it is the same in principle to the above argument except 
that there are additional terms due to the drift b and jump component j. 
The following martingale condition for f(t,Xt) is then obtained, 



where pt(x) = C 22 (t,x) is the probability density of X t and Jt(y,z) is 



Jt{y,z) =Pt(y)j(t,y,z) / (f(t,x) - f(t,z) + {z- x)f i2 (t,x))0 i2 (t,x)dx. 



J y 

All of the terms in expression (1.10) can be defined for general quasimartin- 
gales, as shown in Section 3, and will give the required martingale condition. 



(1.9) 




(1.10) 
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2 Continuous Martingales 

In this section we state the martingale condition for f(t,Xt) in the case 
where X is a continuous and strong Markov martingale, and show how 
it implies that such processes are uniquely determined by their marginal 
distributions. As discussed above, the martingale condition will be of the 
form H[f£]{Q) = which, for continuously differentiable functions, is defined 
by expression (1.8). We will extend this definition to the following class of 
non-differentiable functions for 9, f and C. Here, f^ \dtf(t,x)\ represents 
the variation of / with respect to t over the interval [0, T\. 

Definition 2.1. Denote by T> the set of functions f : R+ x R — ► R such that 

• f(t,x) is Lipschitz continuous in x and cadlag in t, 

• for every Kq < K\ € R and T € R+ then 



• the left and right derivatives of f(t,x) with respect to x exist every- 



We also define T>k to consist of those functions f € T> with compact support 
in (0, oo) x R. 

Except for the global Lipschitz condition, which is adopted here for con- 
venience, this class of functions is the same as that used for the decompo- 
sition results in [6]. Note that if f(t,x) is convex and Lipschitz continuous 
in x, and right continuous and monotonic in t then f & T>. In particular, 
C(t, x) defined by (1.4) is in T> whenever X is a right-continuous martingale. 
Furthermore, if / 6 T> then f(t,x) will have locally finite variation in t for 
almost every x, and the integral ff • dtf(t,x) dx is well defined and locally 
gives a finite signed measure. 

Let us now introduce use a bit of simplifying notation. Given any func- 
tion / : M + xl^l where its arguments occur as dummy variables in an 
expression, then we shall often omit them. In this case we shall always use 
s, t or u for the first argument and x or y for the second. So, for example 
ff fdxdt is the same as ff f(t,x)dxdt. We shall use /~ to represent the 
left limit 




\dtf(t, x)\dx < oo, 



where. 




Furthermore, when we write / (t, x) then we shall take this to mean the 
derivative of f~(t,x) with respect to x. 
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As was noted earlier, in order for expression (1.8) for M[/,C] to be well 
defined, the partial derivatives of f(t,x) and C(t,x) with respect to x are 
required to exist in an almost-everywhere sense. Fortunately this is indeed 
the case for functions in T>. 

Lemma 2.2. If f,g G V, then f(t,x) and f~(t,x) are differentiable in x 
almost everywhere with respect to the measure JJ ■ \dtg(t,x)\dx. 

Proof. By Lemma 3.3 of [6], f(t,x) is differentiable in x almost everywhere 
with respect to the measure JJ ■ \d t g(t,x)\ dx. We just need to extend this 
to / _ . Note that as f(t,x) is jointly continuous in x and cadlag in t, there 
can only be countably many times t at which f~ ^ f. So, by countable 
additivity of the measure JJ -\dtg(t,x)\dx, we can restrict to fixed times 
T > 0. As f~(T,x) is Lipschitz continuous in x, Lebesgue's theorem says 
that it will be differentiable almost-everywhere with respect to the Lebesgue 
measure (see [5] Theorem 3.2). So, 

J J l{t=r}l{/-(t,z) is not differentiable in x} x ) \ dx 

= J l{/-(T,x) is not differentiable in x} \d(T, ~ 9{T—,x)\dx = 

as required. □ 



This allows ^^q\{6) to be extended to all f,C € T> and 9 G £>k- 
Definition 2.3. For every f,g £ T> define the linear map : £>k 



V[J,g\ ^ = JJ h g ' 2 dt9 dx ~JJ 9 ' 2 b dt9 dx ~JJ 9 ' 2 9 ' 2 dt ^ dx ' 

In particular, if / and g are twice continuously differentiable this defini- 
tion coincides with that given by equation (1.8), and locally defines a 
signed measure with 

/ x , fdfd 2 g dgd 2 f\ , , 

(2 - 1} d ^ = {mdx^ + d-td^) dtdx - 

Then, the martingale condition will be Mf^ci = 0> where C is defined by 
(1.4). In fact, we will go a bit further than this and show that /J>[f t c] deter- 
mines the drift component of f(t,Xt). 

We now define the drift measure of f(t,Xt). Here, the terminology local 
signed measure on M + x R is used to mean a real valued function on the 
bounded Borel subsets of R+ x R, which is a finite signed measure when 
restricted to any bounded Borel set. 



A GENERALIZED BACKWARD EQUATION 



7 



Definition 2.4. Let X be a real valued and adapted cddldg process. Then, 
we write T>(X) for those functions f S T> such that the following decompo- 
sition exists, 

(2.2) f(t,X t ) = M t + A t . 

Here, M is a cddldg local martingale and A is a cddldg previsible process 
of locally finite variation with Aq = 0, and such that f l{( s ,x s _)eS} dA s has 
integrable variation for every bounded Borel subset S of R+ x R. We then 
define the local signed measure by 

vf(e) = E 

for every bounded measurable 9 : R+ x R — > R with bounded support. 

Note that if decomposition (2.2) exists then it is unique. If f(t,Xt) = 
M t + A t = M[ + A' t were two such decompositions then M — M' = A' — A 
would be a previsible local martingale with locally finite variation, and so is 
constant. Therefore M = M' and A = A'. So, fij is well defined. 

We would like to express the martingale property of f(t,X t ) as fij = 0. 
Certainly, it is clear that this condition will be satisfied whenever f(t,Xt) 
is a martingale, because then A = in decomposition (2.2). However, the 
converse is not obvious, and the following result will be required. Recall 
that a process X is said to be quasi-left-continuous if X T _ = X T (a.s.) for 
all previsible stopping times r > 0. 

Theorem 2.5. Let X be a cddldg and quasi-left- continuous strong Markov 
process, f G V{X), and f(t,Xt) = Mf + At be decomposition (2.2). Then, 
for every non-negative and measurable 9 : R+ xK->l, 

\l*f\(9)=E 

The proof of this is left until Section 8. Here we are using the notation 
\fi\ for the variation of a local signed measure \i, 

|/x|(0) = sup fi(g9) 
\g\<i 

where the supremum is taken over all bounded and measurable g with \g\ < 
1. In particular, if fij = then Theorem 2.5 shows that A is constant, so 
f(t,Xt) is a local martingale. 

We now state the main result for functions of continuous martingales, 
namely that fiwcf] describes the drift of f(t,Xt). 

Theorem 2.6. Let X be a continuous and strong Markov martingale and 
define C E T> by equation (1.4)- If f E"D then the following are equivalent. 



J 9(t,X t -)dA t 



J 9(t,X t -)\dA t 
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• There exists a local signed measure fj> such that A*[/,c] (^) = / or 
all 6 G £> K - 

Furthermore, if these conditions are true then [i = fij. 

The proof of this is again left until Section 8. In the remainder of this sec- 
tion, we show how Theorem 2.6 implies that continuous and strong Markov 
martingales are uniquely determined by their marginal distributions. The 
first step is the promised martingale condition [i\f t c} = 0) which is our gen- 
eralization of the backward Kolmogorov equation (1.2). 

Theorem 2.7. Let X be a continuous and strong Markov martingale and 
f G T>. Then f(t,Xf) is a martingale if and only if Wf i( jj(9) = for all 

Proof. First, suppose that f(t, Xt) is a martingale. Then we can take A = 
in decomposition (2.2), and it follows that / G T)(X) and fij = 0. Theorem 
2.6 gives fx [f>C ](0)=nf(0-) = 0. 

Conversely, suppose that = 0- Taking fi = in Theorem 2.6 shows 

that / G V(X) and nj = 0. If f(t, X t ) = M t +A t is decomposition (2.2) then 
Theorem 2.5 says that A has zero variation, so A = and f(t, Xt) = Mt is 
a local martingale. As X is a martingale and f(t,x) is Lipschitz continuous 
in x, it follows that for every t > 0, f(r,X T ) is uniformly integrable over all 
stopping times r < t. So, f(t,X t ) is a proper martingale. □ 

In order to complete the proof that continuous and strong Markov mar- 
tingales are uniquely determined by their marginal distributions, we require 
the following result, which follows from Theorems 1.5 and 1.6 of [7]. 

Theorem 2.8. Let X be a continuous and strong Markov martingale, g : 
K — > K be convex and Lipschitz continuous, and choose any T > 0. Then 
there exists an f : M + xM^R such that f(t,x) is convex and Lipschitz 
continuous in x, right- continuous and decreasing in t and 

(2.3) f(t,X t )=E[g(X T )\F t ] 

for all t <T . 

In particular, / G T>. The proof of Theorem 1.2 now follows easily. 

Proof of Theorem 1.2. Note that for any T > 0, the stopped processes X T 
and Y T are continuous and strong Markov martingales. Define C G T> by 
C(t,x) = E [(Xf — x)+] and choose any convex and Lipschitz continuous 
g : R — ► R. Then choose / G T> such that equality (2.3) is satisfied for 
every t < T. By definition, f(t, Xf) is a martingale, so Theorem 2.7 gives 
M[/,C] = 0. 
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As X and Y have the same marginal distributions, C is also given by 
C(t, x) = E [(Y^ — x) + ] . So Theorem 2.7 says that f(t, Y^) is a martingale. 
In particular, equality (2.3) is satisfied with Y in place of X, showing that 
and (Yt,Yr) have the same distribution whenever t < T. As X 
and Y are required to be Markov and have the same initial distributions, 
they must have the same joint distributions. □ 

3 Quasimartingales 

In the previous section the martingale condition fJ*\f t c] = was described 
for functions of continuous and strong Markov martingales, and it was 
shown that this implies that such processes are uniquely determined by 
their marginal distributions. We now turn to the more general case where 
the process X need not be either continuous or a martingale. The idea here 
is to use a generalization of (1.10). Note that this equation involves the 
drift b and jump rate j of the process, so it will be necessary to restrict to 
processes for which a generalization of these terms can be defined. For that 
reason, we look at quasimartingales. 

However, the results stated in this section will not be as strong as for 
the continuous martingale case. In particular, we will not extend Theorem 
1.2 to quasimartingales. This is a subject which will be taken up in more 
detail in a later paper. 

The simple previsible processes are defined to be linear combinations 
of processes of the form Alj^r where T > and A is an .F^-measurable 
random variable. Then, the definition of quasimartingales used in this paper 
is as follows. 

Definition 3.1. Let X be a real valued stochastic process such thatM [\Xt\] < 
oo for every t € M + . Then, define 

: £ is simple previsible and |£| < ll . 

Then, a quasimartingale X is a cddldg adapted process such that E [\Xt\] < 
co and Var x (t) < co for every t 6 M+ . 

This definition of quasimartingales is slightly more general than that 
used in [4], [8] and [9]. This is because we are only really interested in the 
properties of processes over finite time horizons, and Definition 3.1 is the 
same as saying that the stopped process X 1 is a quasimartingale according 
to the usual definition for every finite time t. 

We also require a uniform integrability property for the processes. If 
{X T : r is a stopping time} is uniformly integrable, then the process X is 
said to be of class (D). As we are only interested in finite time horizons, the 
following definition will be used. 



Varx(t) = sup < E 



idX 
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Definition 3.2. A process X is of class (DL) if the stopped process X 1 is 
of class (D) for every t € R+. 

If C(t,x) is defined by equation (1.4), then it is in T> whenever X is a 
class (DL) quasimartingale. So, A*[/,c] defined in the previous section is also 
defined for quasimartingales. 

Lemma 3.3. Let X be a quasimartingale of class (DL) and set C(t,x) = 
E [{X-t — x)+] ■ Then, C(t, x) + Varx(i) is an increasing function oft and 
C G V. 

Proof. If t > s then Jensen's inequality gives 

C(t,x)=E[(X t -x) + ]>E[(E [X t \T a ]-x)+] 

> E l(X s - x)+] - E [|E [X t - X S \F S }\] 

> C(s, a;) - Varx(t) + Var x (s) 

Therefore, C(t,x) + Varx(t) is increasing in t. So, 

/ \d t C(t,x)\dx < / (C(T,x) + 2Var x (T)) dx < oo. 

K JO JK 

As (Xt — x)+ is convex in x it follows that C(t,x) is also convex in x. 
Finally, if we choose t n 1 1 then (Xt n —x) + is a uniformly integrable sequence 
converging to (X t — x) + . So, C(t n ,x) — > C(t,x). This shows that C(t,x) is 
right-continuous in t, so C E P. □ 

It can be shown that this result remains true even without the class (DL) 
assumption, although that is not required here. Restricting to quasimartin- 
gales allows us split X up into a martingale part and a drift component, 
using the following version of Rao's decomposition. 

Lemma 3.4. If X is a quasimartingale then it has the unique decomposition 
(3.1) X = M + A 

where M is a local martingale and A is a cadlag previsible process with 
locally finite variation and Aq = 0. Furthermore, for every t > 0, 



E 



|<L4| 



o 



< Var x (t) 



and, if X is of class (DL), M is a martingale. 

Proof. First, Rao's Theorem ([8] III Theorem 15) says that decomposition 
(3.1) exists and is unique. 

Choose any T > and consider the quasimartingale If = Xj — E [Xj 1 !^]. 
As Yf = for t >T, the definition of \arx(T) above agrees with the condi- 
tional variation Var(Y) used in [4] VIII Section 2. So Rao's decomposition 
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([4] Theorem 8.13) gives Y = Z 1 — Z 2 for non- negative supermartingales 
Z X ,Z 2 satisfying E \_Z\ + Z$] = Var x (T). By the Doob-Meyer decompo- 
sition ([8] III Theorem 13), Z l = N* - B l (i = 1,2) where N { are local 
martingales and B % are increasing previsible processes with Bq = 0. So, the 
uniqueness of decomposition (3.1) gives A T = B 2 — B l and 



E 



f \dA\] <E[B^ + Bl] < E [Z\ + Zl\ = Var x (T). 
Jo 



Finally, if X is of class (DL) then so is M = X — A, in which case M is a 
martingale. □ 

Decomposition (3.1) allows the drift measure of X to be defined as fol- 
lows. 

Definition 3.5. Let X be a quasimartingale and X = M + A be decom- 
position (3.1). Then, define the drift measure of X to be the local signed 
measure fix given by 



POD 

/ 6{t,X t -)dA t 
Jo 



for all bounded and measurable 9 : R + xM^R with bounded support. 

The drift measure \ix enables equation (1.10) to be extended to more 
general processes simply by replacing the pt(x)b(t, x) dx dt term by d/ix (t, x). 
It remains to extend the jump component Jt(y, z), which is done as follows. 

Definition 3.6. Let X be a cddldg process. For 9, f E T> define 

(3.2) J t X (9,f)= (f(t,x) - f(t,X t ) + (X t - x)f-(t,x)) 9-(t,x)dx. 
Jx t - 

This allows equation (1.10) to be rewritten in a form that makes sense 
for all class (DL) quasimartingales. In order for this expression to be well 
defined, it is necessary for the sum of the jump terms J*(0, f) over all times 
t to be integrable. This is the CclSG, clS will be shown in Lemma 4.2. So, 
generalizing the left hand side of (1.10) gives the following definition. 

Definition 3.7. Let X be a class (DL) quasimartingale. Then, for every 
f £ T> define the linear map jlj : X>k —* R, 



(if (9) = pt [f , C ](0) + f [ 9-(t,y)f~(t,y)dydfi x (t,x)+K 
where C £ V is defined by C(t, x) = E [(X t — x)+]. 



t>o 
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In particular, if X is a continuous martingale then the drift and jump 
terms above are zero, so fij = ^[f,c]- Using this definition, we now state the 
main result of this paper for quasimartingales. Recall that fif is the drift 
measure of f(t,Xt) introduced in Definition 2.4. Note that here we make 
no requirement for X to be a Markov process. 

Theorem 3.8. Let X be a class (DL) quasimartingale and f E T>(X). Then 
nf(d~) = flf(d) for all 9eV K . 

The proof of this is left until Section 6. In particular, if f(t,Xt) is a 
martingale then it follows that / is in T>(X) and [ij = 0. So Theorem 3.8 
gives jlj = 0, and represents a general form of the backward equation (1.2). 
Conversely, if / € D(X) and jlj = this shows that fij = and, if X is also 
strong Markov and quasi-left-continuous, Theorem 2.5 implies that f(t,Xt) 
is a martingale. However, if we only know that jlj = then this does not 
allow us to infer that / is in T>(X), so Theorem 3.8 cannot be applied. 

The reason that we do not obtain as strong results for quasimartingales 
as in the continuous martingale case is that, for conditional expectations of 
convex functions of X, Theorem 2.8 allowed us to restrict attention to / G T>. 
This is required in the proofs of Theorems 1.2 and 2.6. Unfortunately, this 
result applies only to continuous martingales. In a later paper we will look at 
conditions on the process X under which / can be approximated sufficiently 
well by functions in T>(X), in order to extend the results for continuous 
martingales to processes with non-zero drift. 

4 Calculating fij For Smooth Functions 

We start the proof of Theorem 3.8 by looking at the case where / is twice 
continuously differ entiable. This is much simpler than for general functions 
in V, because Ito's formula can be applied to explicitly decompose f(t,Xt) 
into local martingale and finite variation terms. 

First, as the quadratic covariation [X, Y] of semimartingales X and Y is 
a cadlag process with locally finite variation whose jumps satisfy A[X, Y] = 
AX AY, its continuous part can be written as 

(4.1) [X, Y] c t = [X, Y] t - AX S AY S . 

s<t 

We also write [X] c = [X, X] c for the the continuous part of the quadratic 
variation of X. Then, the following lemma will be used to evaluate terms of 
the form E [J 9(t, X t ) d[X] c ] . 

Lemma 4.1. Let X be a quasimartingale of class (DL). Define C £ V by 
C(t, x) = E [{Xt — x) + ] and let fix be the drift measure of X. Then, for any 
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bounded and measurable 



(4.2) 
(4.3) 
and 

(4.4) 



E 
E 



E 

,t>o 



: R + xI->M with bounded support, 

\e(t,x t )\ d[xf t <oo, 

x t 

(X t - x) \9(t, x)\ dx 

x t - 



9d t Cdx = -E 
2 



+ E 



E 

l_i>0 



9(t,X t )d[X]$ 

"X t 



+ 



6(t,y)dydnx(t,x) 



(X t - x)6(t,x)dx 



x t - 



Proof. By linearity, it is enough to prove this for non-negative 0, for which 
equation (4.4) follows from substituting the definition of \xx into Lemma 4.3 
of [6]. As C G T> (Lemma 3.3) and \ix is finite on finite time intervals, the 
left hand side and the second term on the right hand side of (4.4) are finite. 
Then, as the remaining terms on the right hand side are non-negative, they 
must also be finite. □ 

Inequality (4.3) shows that the sum of the jump terms Jt(6,f) used in 
Definition 3.7 is integrable, so fij is well defined. 

Lemma 4.2. Let X be a class (DL) quasimartingale, f E T> and 6 6 £>k- 
ThenE[J2 t>0 \Jt X (0J)\} < oo. 

Proof. If \f >2 \ <L, 



J t x (o,f) 



x t 



X t - 
X, 



((f(t,x) - f(t,X t ) + (X t -x)f-(t,x)) 9~(t,x)dx, 



< / (X t -x)(L+\f-\)9-dx, 



X, 



and inequality (4.3) gives the result. 



□ 



Lemma 4.1 can be used together with Ito's formula to prove Theorem 
3.8 when / is continuously differentiable, as we now show. Here we restrict 
to functions / with compact support. This is just for simplicity, and in any 
case it will be extended to general / S *D(X) later. 

Lemma 4.3. Let X be a class (DL) quasimartingale and f 6 "Dk be twice 
continuously differentiable. Then f G T> (X) and [ij(0~) = ftj(0) for all 

8eV K . 
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Proof. Let X = M + A be decomposition (3.1) and apply Ito's formula, 



f(T,X T ) = N T + V T 

rT 



N T = f(0,X )+ [ f i2 dM t , 
Jo 

Vt = \! f, 22 d[Xf t + [ f, 2 dA t + [ f^dt + J^J'i 

z Jo Jo Jo t<T 

J° = f(t,X t )-f-f j2 AX t = [ X 

Jx t 



{X t - x)f t22 (t,x)dx. 



Here, for brevity, we have omitted the arguments for all functions whenever 
they are (t, Xt—). As A has integrable variation over finite time intervals, 
inequalities (4.2) and (4.3) show that V has integrable variation, so / is in 
T>(X). It follows from the expression above for V that, for any 9 6 2?k> 



2 



0-f, 2 ,d[X] c t 



J E[9-f a ]dt + n x (6-f, 2 )+E 



t>0 



By substituting 9 f a2 in place of 9 in equation (4.4), we can eliminate [X] c 
from the above to get 



=P+Q+E 

where P, Q and j\ are as follows, 

P= [[ 6~f i22 d t Cdx + J E [9~f tl 



t>o 



dt 



Q = nx{0 f, 2 



'(t,y)f,22(t,y)dydij,x{t,x) 



9 2 (t,y)f t2 (t,y)dydfj, x (t,x) 

J, 1 = 0-(t,X t -)J? - (X t - x)0-(t,rE)/ 22 (t,x) fix 
Jx t _ 

(X t -x)(0-(t,Xt-) - 9-(t,x))f t3a (t,x)dx 

>X t - 

Here, we have applied an integration by parts to Q, and applying a further 
couple of integration by parts to J\ gives J\ = (9, /). It only remains to 
be shown that P = /ir*^^). We use the following identity which applies to 
all g,h € £>k 



g dthdx 



h 2 dtg dx. 
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If either g or h is twice continuously differentiable then this follows from in- 
tegration by parts. More generally, it is given by [6], Lemma 3.1. Substitute 
9f 2 in place of g and C in place of h, and again apply integration by parts, 



(e~u 2 +e-f a )d t cdx = J J c 2 dt(6f t2 )dx 

f t2 C 2 d t 0dx + ft 6C 2 f t21 dtdx 



f >2 C 2 d t 6dx -JJ 6 a C 2 f a dtdx-JJ 0f a d x C 2 dtdx. 
So, as required, Definition 2.3 of M[/,C7] (^) gives 

vim (*) = // d ~f,22 d t c dx+ ff e-f a d x c 2 dt = p. □ 



5 Dirichlet Processes 

As we saw in the previous section, the identity fij(6~) = p,J(8) follows 
from Ito's formula when / is twice continuously differentiable. However, 
this method cannot be applied to non-differentiable functions. Instead, we 
use the results of [6], where it was shown that f{t,X t ) can be decomposed 
into a local martingale and a term whose quadratic variation exists and has 
zero continuous part. We now look at some of the properties of these Dirich- 
let processes, which will be required in the remainder of this paper. Such 
processes form a generalization of semimartingales, and allow decomposition 
(2.2) to be generalized to all /eP. 

We start with the definition of the quadratic variation for general cadlag 
processes. A (stochastic) partition P of R+ is a sequence of stopping times 
= To < T\ < T2 < ■ ■ ■ such that r n — > oo as n — > oo. For any partition P 
we write the sequence it defines as , so 

= To < t[ < r 2 p < • • • t oo. 

Use \P\ to denote the maximum step size of P, 

I Dl II P P II 

fceN 

Here |[C7"|| 00 is the smallest positive real number that is almost surely an 
upper bound for \U\. Then, for cadlag processes X, Y the approximation 
[X, Y] p to the quadratic covariation along a partition P is 

oo 

(5.1) [X, Y]f = ]T(X t P At - X tLiAt )(Y t P At - Y t ,_ iAt ). 

k=l 

The quadratic covariation is then defined as the limit of [X, Y] p as the 
partition step size \P\ goes to zero. This limit is taken in the topology of 
uniform convergence on compacts in probability (ucp for short). 
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Definition 5.1. Let X, Y be real valued cadlag processes. Then their 
quadratic covariation [X,Y], if it exists, is a cadlag process such that 

[X,Y] P [X,Y] 

as \P\ — > 0. Given any cadlag process X then its quadratic variation [X], if 
it exists, is defined to be [X] = [X, X]. 

From uniform convergence, the jumps of [X, Y] must satisfy A[X, Y] = 
AX AY. Whenever [X], [Y], [X,Y] all exist then [X,Y] = [X + Y]/2 - 
( [X] + \Y] )/2 is a difference of increasing processes and so has locally hnite 
variation. As with the semimartingale case, the continuous part of the 
covariation, [X,Y] C , is defined by (4.1), and [Xf = [X,X] C . 

We shall say that V is a zero continuous quadratic variation process 
(z.c.q.v. process for short) if it is cadlag, adapted and its quadratic varia- 
tion exists and satisfies [V] c = 0. In particular, every cadlag locally finite 
variation process has zero continuous quadratic variation (see, for example, 
[8] II Theorem 26). The following result was proven in [6] Lemma 1.5. 

Lemma 5.2. Let X be a cadlag process whose quadratic variation exists 
and V be a z.c.q.v. process. Then, the covariation [X, V] exists and satisfies 

[X,V] t = Y,^X s AV s . 

s<t 

This is equivalent to saying that [X, V] c = and it follows that the class 
of z.c.q.v. processes is closed under taking linear combinations. Furthermore, 
if A is a previsible z.c.q.v. process and X is a semimartingale, 

(5.2) [i,I] ( =VAi s AI s = I AAdX. 

s<t J ° 

In the special case where X is a local martingale, this shows that [A, X] will 
also be a local martingale. We use the following definition. 

Definition 5.3. A process X is a Dirichlet process iff it has the decompo- 
sition X = M + V for a local martingale M and a z.c.q.v. process V . 

Such processes were introduced by Follmer in [3], although only con- 
tinuous processes were considered. The definition was extended to non- 
continuous processes in [10] and [1], where a Dirichlet process is defined as 
the sum of a semimartingale and a process with zero quadratic variation. 
Here we are using the more general definition from [G] where the zero contin- 
uous quadratic variation component is also allowed to be non-continuous. In 
particular, a semimartingale is a sum of a local martingale and a locally fi- 
nite variation process, so is a Dirichlet process. Note that Lemma 5.2 implies 
that the quadratic covariation between any two Dirichlet processes is well 
defined. The reason for our interest in Dirichlet processes is the following 
decomposition. 
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Lemma 5.4. Let X be a semimartingale such that X% = sup s<t |A" s | is 
locally integrable, and f £ V. Then, there exists a unique decomposition 

(5.3) f(t,X t )=M t + A t 

where M is a local martingale and A is a previsible z.c.q.v. process with 
A = 0. 

Proof. First, by Theorem 1.4 of [6], f(t, Xf) is a Dirichlet process. As f(t, x) 
is Lipschitz continuous in x, sup s<t \f(s, X s )\ is locally integrable, so decom- 
position (5.3) follows from Lemma 1.6 of [6]. □ 

As every cadlag process with locally finite variation also has zero con- 
tinuous quadratic variation, decompositions (2.2) and (5.3) coincide in the 
case that / 6 V{X). We shall use the process A in decomposition (5.3) as 
a generalization of the drift of f(t,Xt). 

It will be necessary to integrate with respect to to Dirichlet processes. 
It is possible to do this by taking limits over partitions in a similar way 
to the definition of the quadratic covariation above. However, whatever 
definition is used, by analogy with the case where Y is a semimartingale the 
following integration by parts formula would be expected to hold. For any 
semimartingale X and adapted Dirichlet process Y, 

(5.4) X t Y t = X Y + I X s ^ dY s + I y s _ dX s + [X, Y] t . 

Jo Jo 

This can be taken as the definition of Jq X s _ dY S: as all the remaining terms 
are already defined. Then, the jumps of U = J X s _ dY s satisfy 

AU t = X t _AY t , 

which follows directly from (5.4). In the case where A is a previsible z.c.q.v. 
process and X is a semimartingale, combining equations (5.4) and (5.2) gives 

(5.5) X t A t = X Aq + I X s _ dA s + f A s dX s . 

Jo Jo 

Furthermore, the jumps of B = J X s _ dA s are ABt = Xt-AAt , which 
is previsible. So B will also be previsible. In fact, it will also have zero 
continuous quadratic variation as the following result shows. 

Lemma 5.5. Let X be a semimartingale and V be a z.c.q.v. process. Then, 
J X s _ dV s also has zero continuous quadratic variation. 

Proof. We make use of the result that if / : R — ► R is continuously differen- 
tiable and Z = X + V then 

f(Z t )- f f(Z s J)dX s 
Jo 
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has zero continuous quadratic variation. This is a special case of Theorem 
2.1 of [l ]. Applying this result with f(x) = x 2 /2 shows that 



(5.6) 



i r l 

-{X t + V t f- / (X s „ + V s -)dX s 
1 Jo 



has zero continuous quadratic variation. Substituting X = and then V = 
into (5.6) shows that V 2 /2 and Xf /2— A s _ dX s are also z.c.q.v. processes. 
Subtract these from (5.6) to get the z.c.q.v. process 

X t V t - f V s _ dX s = [ A s _ dV s + X V + [X, V] t . 
Jo Jo 

As XqVq + [X, V]t locally has finite variation, it follows that J A s _ dV s has 
zero continuous quadratic variation. □ 

The associativity of stochastic integration also extends to integration 
with respect to Dirichlet processes. 

Lemma 5.6. Let X be a Dirichlet process, Y, Z be semimartingales, and 
U t = Jq Y s - dX s . Then, 

(5.7) f Z s _dU s = [ Z s -Y s _dX s . 

Jo Jo 

Proof. If A is a semimartingale, identity (5.7) follows from standard rules 
of stochastic calculus. So, by linearity, we may restrict to the case where X 
is a z.c.q.v. process with Aq = 0. For brevity, define X Y to be the integral 



(X-Y) t = I X s -dY s , 
Jo 



which is well defined whenever A, Y are semimartingales, or when one is a 
semimartingale and the other is an adapted z.c.q.v. process. Then, by the 
normal rules of stochastic integration, X-(Y-Z) = (XY)-Z whenever Z is a 
semimartingale. We will also write X ~ Y to mean that A — Y is a locally 
finite variation process with zero continuous part (i.e., a pure jump process). 
So X = Y if and only if A ~ Y, AA = AY and A = Y . Then, 

Z-(Y-X) ~ Z(Y-X) - (Y-X)-Z 

= ZYX - Z(X-Y + [X,Y]) - (Y-X)-Z 

~ ZYX - (ZX)-Y - (X-Y + [A, Y\\Z - [Z, X-Y] - (Y-X)-Z 
= ZYX - (XZ)-Y - (XY)-Z - X-[Z, Y] 
= ZYX - X-(ZY) 
~ (ZY)-X. 
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Here we have used one integration by parts per line, and dropped any term 
that is just a quadratic covariation involving X or YX , as these will be pure 
jump processes. The terms involving Xq have also been dropped because 
we are restricting to Xq = 0. Finally, 

A(Z-(Y-X)) = Z-A(Y-X) = Z_11AX = A((ZY)-X), 

so Z-(Y-X) = (ZY)-X as required. □ 

This allows Definition 2.4 of to be extended to general / G T>. 

Definition 5.7. Let X be a class (DL) quasimartingale, f G T>, and let 

f(t,X t ) = M t + A t be decomposition (5.3). If 9 £ £>k is such that 9{t,X t ) 
is a semimartingale, then we say that 9~ is fij integrable if and only if 













E 


sup 


f 9-{s,X s J)dA s 




< oo 




.t>0 


Jo 







In that case we define fJ-f{9 ) =E [J 6 (s, cL4 s ] . 

6 Proof of Theorem 3.8 

In Lemma 4.3 it was shown that [ij(9~) = flj (9) for twice continuously 
differentiable functions /. The method that we shall use to extend this result 
to more general functions is to employ integration by parts to exchange the 
roles of 9 and /. The idea is to show that fij and jlj both satisfy the 
same integration by parts formula, which will prove the result when / has 
compact support. The extension to general / € T> will make use of the 
following result. Note that here Definition 5.7 for [ij is being employed, 
which does not require / to be in T>(X). 

Lemma 6.1. Let X be a class (DL) quasimartingale, f G T> and 9 G Pk 
be such that 9(t, Xt) is a semimartingale. If f and 9 have disjoint supports 
then 9~ is fij -integrable and fj,J(9~) = fij(9). 

Proof. Define the increasing sequences of stopping times cr n ,T n by tq = 
and 

an = mf {t > T n -i : 9(s, X s ) + 0} , r n = inf {t > a n : f(s, X s ) ^ 0} 

for n G N. Note that (a n , X an ) is in the support of / and (r n , X Tn ) is in the 
support of 9. If these sequences converge to a finite limit r, then (r, X T -) 
would be in the intersection of the supports of / and 9. However, this 
intersection is empty, so T n and a n increase to infinity. 
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Set Y t = f(t,X t ). Then, the pure jump process V = Y^=i\<r n ,T n [ 
satisfies VY = 0, so integration by parts gives 

I V s . dY s = [V, Y) t - [ y s _ dV s = V(AV S AY S - Y S .AV S ) = V V S .Y S . 

Similarly, if Z t = 9(t,X t ) then (1 — V)Z = 0. Applying the above equality 
together with identity (5.7), 



(6.1) 



I z s .dY s = [ v^rfy s = Vv^ s 

Jo Jo s < t 

= J2Zs-Y s = Y / jf(0,f). 



s<t 



s<t 



For the last equality, we have used the fact that the supports of 9 and / are 
disjoint in Definition 3.6 to get J^-(9,f) = Z S JY S . 

Then, Lemma 4.2 shows that J Z_ dY has integrable variation and, by 
the Doob- Meyer decomposition, can be written as N + B for a cadlag mar- 
tingale N and previsible process B with integrable variation and Bq = 0. If 
Y = M + A is decomposition (5.3) then N — f Z_ dM = J Z_ dA - B is 
a previsible local martingale with zero continuous quadratic variation, and 
so is constant. Therefore, j Z_ dA = B has integrable variation. So 6~ is 
/^-measurable and, 



^ {6 



E\B r 



E 



Z s - dY s 



E 



t>0 



The final equality just follows from Definition 3.7 of ftj(9), noting that 
the M[/,c](0) and fix terms are zero due to the supports of 9 and / being 
disjoint. □ 

We require the following quadratic covariations result from [6] . If X is a 
semimartingale and / 6 T> then the quadratic variation of f(t,X t ) satisfies 

(6.2) [f(;X.)] t = f Us,X s fd[Xr s + Y,^f(s,X s )f. 

J ° s <t 

See [6] Theorem 1.4 and Lemma 1.5. This allows us to prove the following 
integration by parts formula for jlj. 

Lemma 6.2. Let X be a class (DL) quasimartingale and f,g£ D K . Then 
the quadratic covariation of f(t,Xt) and g(t,Xt) exists, has integrable vari- 
ation and, 



(6.3) 



A? G?) + K (/) + E [[/(-, X.),g(; X. )]J = 0. 



7.X 
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Proof. Set Y t = f(t,X t ) and apply Definition 3.7, 

£/(/)= ff(LC 2 -f-c-)d t fdx- Jjif-fdtCdx 

+ [ f f-{t,y) 2 dydnx(t,x) 



(6.4) 



+ E 



E /* ((/ - y *)4" + (** - ^)(4") 2 ) dx 



Substitute (/ ) 2 for 6 in equation (4.4) and subtract from (6.4), 



£/(/) 



(6.5) 



(LC 2 -f~C~)d t fdx--E 



f 2 (t,X t ) 2 d[X] 



+ E 



E [ X \f-Y t )f-dx 
t>o JXt - 



We require the following identity, which is simply an application of integra- 
tion by parts. Here, the notation Atf = f — f~ is used. 

(6.6) J x \f-Y t )f-dx + ^AY t 2 = J (f i2 l {x<Xt} -f-l {x<Xt _ } )A t fdx. 

Note that, as f(t,x) is cadlag in t, there can be only countably many times 
t > at which f(t,x) ^ f~(t,x). So, summing the right hand side of (6.6) 
over all t > gives 

/ 1 (f,2 1 {x<x t } - f^{x<x t -}) dtfdx. 

As f j2 has bounded support, this integral is bounded. Then, sum (6.6) over 
t > and take expectations, 



E 



Here, we have substituted in — C 2 and — C 2 for P (X t > x) and P (X t - > x) 
respectively. This equality can be added to equation (6.5) to get 



fif (/) = -^E 



[Moo] • 



Here, equality (6.2) has been used. This proves the result in the case where 
/ = g. The general result then follows from the linearity of (6.3) in both / 
and g. □ 
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This integration by parts formula can be used to exchange the roles of 9 
and / in the expression for fij(9~). 

Lemma 6.3. Let X be a class (DL) quasimartingale, and f,9 G T>k- If 
9 G D(X) then 9~ is -integrable. If, furthermore, fi(f(f~) = Ag^Cf) then 
»f(9-) = jlf(9). 

Proof. Set Ut = f(t,X t ) and Vt = 9(t,X t ) and use the decompositions 
U = M -\- A, V = N + B where M, N are local martingales and A, B are 
previsible z.c.q.v. processes. Integration by parts gives 

(6.7) I U s - dB s + I V s - dA s + [U, V] t = U t V t - L t 

Jo Jo 

where L is the local martingale 

Lt = I U s . dN s + I V s . dM s . 
Jo Jo 

However, by equation (5.2), [M, A] and [N, B] are local martingales, so 
E [[U]] = E [[M] + [A]] and E [[V]] = E [[N] + [B]]. As Lemma 6.2 says that 
[U] and [V] are integrable, it follows that [M] and [N] are integrable, so 
M and are square integrable martingales. Therefore L is also a square 
integrable martingale. In particular, it is a martingale. If 9 is in T){X) then 
j U- dB has integrable variation, and it follows from (6.7) that 



sup 

t>o 



t 

V,- dA f 



is integrable, so 9 is ^-integrable. Take expectations of (6.7) and substi- 
tute in the definitions of [if , to get 

^(f-) + vf(9-) + E[[U,VU=0. 

Combine this with equation (6.3) to eliminate the quadratic covariation 
term, 

and the result follows. □ 

Lemma 6.3 can be applied to calculate fJ-f(9) for general / G V, in the 
case where 9 is twice continuously differentiable. 

Lemma 6.4. Let X be a class (DL) quasimartingale and f G £>. If 9 G T>k 

is twice continuously differentiable then 9 is fij -integrable and /J.J{9) = 
fif(9). 
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Proof. By choosing any g 6 T>k such that g = 1 on an open set containing 
the support of 9, and writing f = gf + (1 — g)f , then the linearity in / 
means that we only need to prove the result separately for the cases where 
/ 6 T>k and where / and 9 have disjoint support. 

First, suppose that / 6 2?k- Then Lemma 4.3 says that 9 G V (X) and 
nf(f~) = jiff if), and Lemma 6.3 gives the required result. Finally, if / 
and 9 have disjoint support then the result follows from Lemma 6.1. □ 

In order to prove Theorem 3.8 we just need to extend the previous re- 
sult to non-differentiable 9. The idea is simple enough - just smooth 9 by 
convolving with smooth functions and take limits of jlj(6). However, it is 
rather tricky to show that these limits do in fact converge to the desired 
limits. We do this by smoothing 9(t, x) separately in t and x. 

Proof Of Theorem 3.8. First suppose that 9(t, x) is twice differentiable with 
respect to x and 9 t2 ,9 )22 £ X>k- Then, choose any twice continuously differ- 
entiable a : (0, 1) — > K with compact support and whose integral is equal to 
1. Define ^ 

9 n (t,x)= / 9{t- s/n,x)a{s)ds. 



o 



For n large enough such that the support of 9 is contained in (1/re, oo) x R, 
it follows that 9 n S £>k is twice continuously differentiable, so Lemma 6.4 
gives fif(9 n ) = fif(9 n ). As 9 n -> 6~, dominated convergence implies that 
[ij(9 n ) converges to /j,J(9~). 

We now show that flj(9 n ) converges to £ij(9). As {9 n ) )2 converges to 9~, 
dominated convergence implies that the drift and jump terms in Definition 
3.7 do indeed converge. The remaining term is 



fjf* c * d ^ n dx ~ If dtC dx ~ II dtf dx - 



g(t + s/n,x) ds dt9 a dx 



Again by dominated convergence, the last two terms on the right hand side 
will converge. We look at the first term, which can be rearranged to get 

(6.8) J J f a C 2 d t 9 n dx = J! g d t 9 n 2 dx = J J J 
where 

f'OO f'OC 

g(t,x)= f t2 (t,y)C 2 (t,y)dy = - f 2 (t, y)P (X t > y) dy 

(6.9) Jx Jx 

= E[l {Xt>x} (f(t,x)-f(t,X t ))]. 

As / is Lipschitz continuous in x and Xt is right-continuous and uniformly 
integrable on bounded intervals, it follows that g(t, x) is right-continuous in 
t. So, taking limits in (6.8) and applying dominated convergence, 

f 2 C 2 d t 9 n dx^ (I gd t 9 2 dx= [[ f 2 C 2 d t 9dx 
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as n — > oo. Therefore, jij(9 n ) = ftj(9 n ) converges to £lJ(9), and it follows 
that nf(9~) = fif(9). 

Now choose any 9 € £>k and twice continuously differentiable a : M — > R 
with compact support and which integrates to 1. Set 

(6.10) 9 n (t, x) = j 9(t, x + y/n)a(y) dy. 

Then, 9 n (t,x) is twice differentiable in x and 9 n , 9™ and 9™ 2 are all in 
Z?K- The argument above shows that fif((9 n )~) = flj(9 n ). Choose any 
(t,x) € x 1R such that f(t,x) and C(t,x) are differentiable in x. Then, 
P (Xt = x) = and if g is defined by equation (6.9) it follows that g(t, x) is 
differentiable with respect to x and, 

g a (t,x) = E [l{x t >x}f,2(ti x )] = -f,3(^ x ) c ,2(^ x )- 

By Lemma 2.2, f(t,x) and C(t,x) are differentiable in x almost everywhere 
with respect to the measure JJ ■ \df9\dx, and dominated convergence gives 



LC 2 d t 9 n dx 



g 2 (t,x - y/n)dyd t 9dx -> J J f a C 2 d t 9dx 



as n — > oo. Also, from the definition (6.10) above for 9 n it follows that 
9^(t—,x) converges to 9 t2 (t—,x) as n — > oo whenever 9~(t,x) is differ- 
entiable with respect to x. So, as above, dominated convergence implies 
that nf ((9 n )~) tends to fif (9~) and flf{9 n ) tends to /if (0). So, (if (9) = 
fij(9~) as required. □ 



7 The Time Reversed Conditional Variation 

In the previous section the expression for the drift of f(t, Xt) given by Theo- 
rem 3.8 was proved, under the condition that / is in D(X). However, it still 
remains to show that the conditions specified in Theorem 2.6 are sufficient 
for / to be in T>(X). This direction is complicated by the problem that 
f(t,Xt) cannot be assumed a priori to be a semimartingale. The idea is to 
bound the conditional variation, and show that f(t, Xt) is locally a quasi- 
martingale. Then, Rao's decomposition can be used to decompose it into 
local martingale and finite variation terms. Unfortunately, the conditions 
on / are too weak for this to be an easy route, and instead we will bound 
the conditional variation of the drift term A from decomposition (5.3) un- 
der time reversal. This will be enough to show that A is a finite variation 
process and, consequently, f(t,Xt) will be a semimartingale. The following 
time reversed natural filtration of X will be used. 

gf = a {{X s : s > t} U {A G T : P (A) = 0}) . 

Note that for times s < t, then D . The conditional variation with 
respect to this filtration is defined as follows. 



A GENERALIZED BACKWARD EQUATION 



25 



Definition 7.1. Let X, A be real valued processes such that E [\At\] < oo for 
every t > 0. Define 



V X {A) =supE 



Y,Z k (A t 



where the supremum is taken over all sequences < to < t± < ■ ■ ■ < t n and 
all -measurable random variables \Zy\ < 1. 

In the current section we show that bounding the time reversed condi- 
tional variation V X (A) is enough to bound the variation of A. Then, in the 
following section, this will be applied to the proofs of Theorems 2.5 and 2.6. 
The main result used is the following. 

Lemma 7.2. Let X be a cadlag real valued process and A be a z.c.q.v. 

process such that sup t>0 \ A t \ is integrable and V X (A) < oo. 

Suppose furthermore that A t — A s is -measurable for all t > s and 
that there exists a measurable u : R+ x R — > R such that A At = u(t,Xt) for 
all t > 0. Then, A has integrable variation satisfying 



E 



\dA\ 



<V r x {A). 



Proof. Pick any T > and let T T be the filtration and B be the cadlag 
process defined as 



r t 



B t = A T _ 



A 



(T-ty 



In order that this makes sense for t > T, we set J~t = and At = Aq 
for t < 0. The condition that At — A s is (/^--measurable for all t > s 
ensures that B is ^-adapted. By the condition of the lemma, ABt = 
AAx-t = u(T — t, Xx-t) is a measurable function of the left-continuous and 
^-adapted process Xx-t- So, AB and hence B are .F. r -previsible. 

Now choose any < to < t\ < ■ ■ ■ < t n and random variables \Zk\ < 1 
such that Zk is T\ k -measurable. Setting = T — t n _h — a for any real 
a > 0, 



B tk - B tk _ x = lim (A„« -A 



Ft k -i 
gence gives 



- , so Zfc is Gs<* ■ 

n + l — k n + l — k 



n + l-k 

measurable, and dominated conver- 



n n 



k=l 



k=l 



So B is an ^'-quasimartingale with conditional variation bounded by V X (A), 
and Rao's decomposition (Lemma 3.4) can be applied, 



(7.1) 



B t = N t + C t . 
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Here, N is an IF!" -local martingale and C is a cadlag ^T-previsible process 
with 



E 



\dC\ 



< VJc(A). 



However, B and C, and therefore N, are ^"-previsible processes, so A" must 
be continuous. Also, as A is a z.c.q.v. process, it follows that N + C and 
therefore N also have zero continuous quadratic variation. Furthermore, the 
continuity of N implies that [N] = and, as A 7 " is a martingale it must be 
constant. So, 



A t = A T . 
for t < T. Finally, 

E 



B 



(T-ty 



A T - 



\dA\ 



o 



r rT 



E 



\dC\ 



(T-ty 



< V r x {A). 



The result follows by letting T increase to infinity. 



□ 



The remainder of this section will be used to show that the conditions 
required for Lemma 7.2 to be applied are satisfied in the cases we need, 
starting with the following result which expresses A A as a function of X. 

Lemma 7.3. Let X be an adapted and quasi-left- continuous process and 
f 6 T>. Suppose that decomposition (5.3) exists, and is f(t,Xt) = Mt + At. 
Then, 

AA t = f(t,X t )-f(t-,X t ) 

for every t > 0. 

Proof. First, choose any previsible stopping time r > 0. The quasi-left- 
continuity of X gives X T = X T -. As A is previsible, E[A^4 r |^v_] = AA T 
and, as M is a local martingale, E [AM r |jF r _] = 0. So, 

A^l r = E[f(r,X T ) - f{T-,X T _)\F T _] = /(r,X T _) - /(r- X T _). 

As X is adapted, both sides of this equality are previsible. So, by previs- 
ible section, it remains true for all times r > 0. Now, as f(t, x) is jointly 
continuous in x and cadlag in t, there exists a countable Scl such that 
f(t,x) = f~(t,x) for all t $ S. Also, the quasi-left-continuity of X implies 
that it is continuous in probability, so AXt = for every t G S. Then, 

AA t = l {tes} (/(i,AV) - f{t-,X t _)) = l {tes} (f(t,X t ) - f(t-,X t )) 
= f(t,X t )-f(t-,X t ) 



as required. 



□ 
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Another condition required by Lemma 7.2 is that At — A s should be 
Q x -measurable whenever t > s. This will follow by applying the following 
lemma to decomposition (5.3). 

Lemma 7.4. Let X be a Markov process and Y = M + A where M is a 
local martingale and A is a previsible z.c.q.v. process with Aq = 0. Then, Y 
is adapted to the natural filtration, T x , of X if and only if A and M are 
T x -adapted. 



Proof. If M and A are .T-^-adapted then clearly Y is also J-^-adapted. Con- 
versely, suppose that Y is .F x -adapted. For the moment, let us suppose that 
[Y]oo is integrable. As identity (5.2) says that [M, A] is a local martingale, 
monotone convergence gives E [[M]oo + [A]oo] = E [[^]oo] < oo. So, M is a 
uniformly square integrable martingale. 

Now let N be the cadlag process N t = E [M t \F x ] ■ As M is adapted, the 
Markov property for X gives Nt = E [Mt|.?£ ] , so is a uniformly square 
integrable -martingale. Set Y = Y — N and M = M — N . 

If, for every n £ N, we let P n be the partition of R given by r^ n = k/n 
then, letting [M] Pn = [M,M] Pn be the approximation to the quadratic 
variation of M given by (5.1), 



[M]f* 



[M]t 



t 00 

Eh 

k=l 



\<S<T^}( M S 



T k-1 



dM, 



is a local martingale. So, by monotone convergence, the expected values of 
the sequence [M]f n are bounded by the expected value of [M]t- Therefore, 
([Mjf™) 1 / 2 is uniformly integrable over all n € N. As the Cauchy-Schwarz 



inequality implies that |[Y,M]f n | is bounded by ([Y] 
take limits 



Pn 



[M]f") 1/2 



we can 



E 



lim E 

n— »oo 



[Y,M]f"|^ c 



lim y^(Yp n - Yp n 



iE 



k=l 



M t^M 



Furthermore, [A, M] is a local martingale with integrable variation bounded 
by ([Alool-Mloo) 1 / 2 ) so it is a proper martingale. Therefore, 



E 



E 



E 



[A,M] t 



0, 



so, M = N and A = Y - N are J" x -adapted. 

Now, returning to the general case, we can define stopping times 



T m = mf{t£R + :[Y] t >m}. 
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Letting Y t Tm ~ be the pre-stopped process equal to Y t for t < r m and Y Tm - 
for t > r m , it follows that [Y rm ~]oo < m. 

As Y is a sum of a local martingale and a cadlag previsible (hence lo- 
cally bounded) process, Y t * = sup s<t \Y s \ will be locally integrable. So, 
Z m = l[ Tm 00 [Ay rm has locally integrable variation, and the Doob-Meyer 
decomposition gives Z m = N m + B m for a local martingale N and cadlag 
previsible process B m with locally integrable variation, and we can take 
N™ = Bq 1 = 0. Then, the argument above can be applied to Y Tm ~ = 
(M Tm - N m ) + (A Tm - B m ) to see that A Tm - B m is ^-adapted. 

Finally, choosing any t > and stopping time a such that Y* is inte- 
grable, 

" rtAcr 

E[l {t<a} \B™\] <E / \dB m \ <E[l {tACT > Tm} |Ar rm |] 
Jo 

< 2E [i {t > Tm} Y;] - 

as m — > oo. As y* is locally integrable, P (cr < t) can be made as small 
as we like, showing that B™ — > in probability as m —> oo. So At = 
lim m _ >00 (^[ m - B^) is JT x -adapted. □ 

Finally for this section, we apply the previous result to provide a suf- 
ficient condition for At — A s to be (y^-measurable, as required by Lemma 
7.2. 

Corollary 7.5. Let X be a Markov process and Y = M + A where M is 
a local martingale and A is a cadlag previsible process with zero continuous 
quadratic variation. If Y t — Y s is Qf -measurable for every t > s then the 
same holds for A. 

Proof. Fix any s > 0, and set 

Xt = l{t>s}Xt, Y = l{ t >s}(Y t - Ys), 

M t = l{t>s}(M t - M s ), A t = l {t > s} {A t - A s ). 

Applying Lemma 7.4 to Y = M + A, we see that A is J-'.^-adapted. In 
particular, for any t > s, A t — A s = A t is measurable with respect to 
F?QG?. □ 

8 Proof of Theorems 2.5 and 2.6 

We now apply Lemma 7.2 to the proof of Theorem 2.5. 

Proof of Theorem 2. 5. Choose any bounded and measurable 9 : R+ xK^ 
K. with bounded support and set 



B t = f 0(s,X s _)dA s . 
Jo 
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We will show that the conditions required to apply Lemma 7.2 to B are 
satisfied. First, as / € T>(X), B has integrable variation. In particular, it 
has zero continuous quadratic variation and is integrable. Lemma 7.5 shows 
that At — A s is (/^"-measurable for every t > s, so Bt — B s will also be Q*- 
measurable. The quasi-left-continuity of X implies that Xt = Xt- whenever 
AA 0. So, applying Lemma 7.3, 



AB t = 9(t, X t )AA t = 9(t, X t )(f(t, X t ) - f(t-,X t )). 

We now bound VJ^(B). Choose any T > and measurable g : I 
Then, as X is strong Markov there exists a measurable h : x 
such that 

h(T,X T )=M[g(X T )\F T ] 



[-1,1]. 
►[-1,1] 



for every stopping time r <T. For every t <T, optional projection and the 
quasi-left-continuity of X gives, 



E[g(X T )(B T -B t )]=E 



j\(s,X s _)9(s,X s _)dA s 



vf(l(t,T]h9) < |/4|(l(t,T]|0|)- 



x 



So, Definition 7.1 gives V^iB) < and, finally, Lemma 7.2 can be 

applied, 

1 < 



E 



Xt 



\dA t 



E 



\dB\ 



as required. 



□ 



The proof of Theorem 2.6 will require the following result, generalizing 
the expression for {if (9) to non-smooth 9 and all / in T>. The proof follows 
in the same way as for Lemma 6.4. 

Lemma 8.1. Let X be a class (DL) quasimartingale and f € "D. If 9 £ £>k 
and 9 € T>(X), then 9~ is [ij -integrable and fij (9~) = flj (9). 

Proof. As in the proof of Lemma 6.4, the linearity in / means that it is 
enough to prove the result separately for the cases where / € T>k and where 
/ and 9 have disjoint support. First, if / € £>k, Theorem 3.8 gives nf (f~) = 
p,(f(f). Then, the result follows from Lemma 6.3. Finally, if / and 9 have 
disjoint support, the result is given by Lemma 6.1. □ 

We also need to know when the product of functions /, g in T>{X) is itself 
in T>(X). This will be true whenever fg is Lipschitz continuous, although 
we only require the case where either / or g has compact support. 



Lemma 8.2. Let X be a class (DL) quasimartingale and f,g £ D(X). If 
either f or g is in Z?k then fg € T>(X). 
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Proof. Without loss of generality, suppose that g G £>k- As with the proofs 
of Lemmas 6.4 and 8.1, the linearity in / means that it is enough to prove the 
result separately for the cases where /, g have disjoint supports and where 

If they have disjoint support then fg = 0G V(X). If / G 2?k then, 
setting ft = f(t,X t ) and gt = g(t,X t ), Lemma 6.3 says that the quadratic 
covariation [/, g] has integrable variation. If ft = Mt + At and gt = Mt + At 
are decomposition (2.2) then integration by parts, 



ftgt 



\ fs- dg s + / g s - df s + [/, g] t 
Jo Jo 

= [ f s -dM s + [ g s -dM s + [ fs-dA s + [ g s -dA s + [f,g] t , 
Jo Jo Jo Jo 

shows that ftgt decomposes into local martingale and integrable variation 
terms, so fg G V(X). □ 

We finish with the proof of Theorem 2.6, which uses Lemma 7.2 together 
with integrals of z.c.q.v. processes studied in Section 5. 

Lemma 2.8 will also be required. If X is a continuous and strong Markov 
martingale, T > and g : M. — > K is convex and Lipschitz continuous then 
this says that there exists an h 6 T> such that 

(8.1) h(t,X t )=E[g(X T )\Ft} 

for every t < T. By linearity, this remains true when g is a difference of 
convex and Lipschitz continuous functions. If, furthermore, \g\ < 1 then h 
can be replaced by {h V — 1) A 1, which will also be in T>. 

Proof Of Theorem 2.6. First, if / £ 'D(X) then the result follows from Theo- 
rem 3.8. Conversely, suppose that there is a local signed measure /x satisfying 
= for all 8 G D K . As X is continuous and a martingale, this 

says that fif(0) = /i(#~). Let f(t,X t ) = M t + A t be decomposition (5.3) 
and choose any twice continuously differentiable 9 G 2?k- Set 



Jo 



B t = / e(s,X s )dA 



s ■ 



We shall show that the necessary conditions for B are satisfied in order for 
Lemma 7.2 to be applied. First, by Lemma 5.5, B is a previsible z.c.q.v. 
process. Also, Lemma 7.5 says that At — A s is t?^-measurable for every 
t > s, and it follows that Bt — B s is t/^-measurable. Furthermore, Lemma 
7.3 gives 



AB t = 6(t,Xt)AA t = 9(t,Xt)(f(t,X t ) - r(t,X t )). 
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It just remains to bound VJ^(B). Pick any T > let g : R — > [— 1, 1] be a 
difference of Lipschitz continuous convex functions. Then, there exists an 
h € T> such that \h\ < 1 and (8.1) is satisfied. Setting Nt = h(t,Xt), 

(8.2) h{t,X t ){B t -B s ) = f h~{u,X u )9{u,X u )dA u + f B u dN u . 

J s J s 

for all t > s in the interval [0, T). Here, identity (5.7) and the integration 
by parts formula (5.5) have been applied. The final term on the right hand 
side is a local martingale (with parameter t £ [s,T)). As Lemma 8.2 says 
that h9 € T>(X), we can take expectations of (8.2) and apply Lemma 8.1 to 
get 

E[g(X T )(B t -B s )] =jif(l la>t) M) = l i(l M h-6) < M(1( S> #D- 

Letting T decrease to t then gives 

E[g(X t )(B t -B s )] < |m|(1 M |0|) 

for all t > s > 0. By the monotone class lemma, this extends to all mea- 
surable g : R — > [— 1, l], so for any C/^-measurable random variable \Z\ < 1, 
the Markov property for X implies that 

E [Z(B t - B a )\ = E [E \Z\X t \ (B t -B.)] < \fi\(l {s ,t]\0\). 

Putting this into Definition 7.1 of V£(B), shows that V r x {B) < \fi\(\9\). 
Therefore Lemma 7.2 can be applied, so B has integrable variation, with 
expectation bounded by So, 

<M(|0|)<oo. 

By monotone convergence, this extends to all continuous 9 with bounded 
support. So A satisfies the requirements of decomposition (2.2), and / is in 
V(X). □ 
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